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We review recent results that provide a new approach to the old problem of naturalness in super- 
symmetric models, without relying on subjective definitions for the fine-tuning associated with 
fixing the EW scale (to its measured value) in the presence of quantum corrections. The approach 
can address in a model-independent way many questions related to this problem. The results 
show that naturalness and its measure (fine-tuning) are an intrinsic part of the likelihood to fit the 
data that includes the EW scale. One important consequence is that the additional constraint of 
fixing the EW scale, usually not imposed in the data fits of the models, impacts on their overall 
likelihood to fit the data (or X 2 / n df, n df number of degrees of freedom). This result has nega- 
tive implications for the viability of currently popular supersymmetric extensions of the Standard 
Model. 
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A new approach to Naturalness. 



1. Naturalness: many questions, few answers. 

The purpose of this talk is to provide a new, different perspective to the old problem of natu- 
ralness in particle theory, based on recent results [1,2] that we review here. As pointed out long 
ago [3], in order to fix the electroweak (EW) scale in the Standard Model (SM) to its current value, 
in the presence of quantum corrections, a tremendous amount of "fine" tuning of parameters is 
required (roughly 1 part in 10 33 ), and this is thought to be unnatural. This tuning is just another 
face of the large hierarchy between the EW and the Planck scales and indicates when a model is 
unnatural, assuming its validity up to the Planck scale. However, "new physics" can enter at some 
lower scale. For example, additional symmetries can naturally explain such hierarchies [4]. One 
such possibility is to build models based on scale (conformal) symmetry [5], not presented here. 
Another possibility is to use low-energy supersymmetry, which is the case discussed below. With 
supersymmetry broken at a scale m msy above few TeV, there is however a "remnant" of the fine- 
tuning mentioned in the SM that worsens as m susy is increased, due to negative SUSY searches. So 
the level of fine tuning became a "measure" of the success of SUSY as a natural solution to the 
hierarchy problem i.e. a measure of the naturalness of a model. 

The problem is that we do not have a widely accepted definition for fine tuning A and criteria 
to decide when a model is natural. This brings many open questions. The first definition was [6] 
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dim 



dlnji 



7={m ,m 1/2 ,Ao,So,Aio---}- (1.1) 



where /; are components of the set y of SUS Y-related parameters that define the model and v is the 
EW scale (higgs vev). This definition is motivated from the physics point of view, and measures the 
stability of the EW scale at the quantum level under small variations of y,-, after SUSY is broken. 
It was used in many tests for the naturalness of the models. But an immediate question arises: A 
depends on parameters Ji, so should these parameters include only those that SUSY introduces or 
also some non-SUSY ones as well, like nuisance variables (e.g. Yukawa couplings)? For example 
top Yukawa coupling could bring the dominant contribution to A, so should we include y t on the list 
of parameters in (1.1)? Further, one can also use other definitions for A, see [7], or more recently 
[8], or for example: 

v-, / <9lnv \ 2 ~i V 2 , , 

L(^]— J j » r={mo,m l/2 ,Ao,B ,Ho---}. (1.2) 

Do all these different definitions for A lead to similar conclusions for the viable regions of the 
parameter space of the model? what is the right definition of fine tuning? Another drawback is that 
A provides a local measure in the y space of the quantum cancellations that fix the EW scale, while 
to compare models, one could say that a more global measure is desirable. How do we compare 
models of different A, over the entire parameter space? 

Even if all theorists agree on a particular definition for A, then what A is acceptable for a 
model to be considered natural! In principle we want A be small, so the EW scale is stable when 
varying y, in the presence of the radiative corrections; but is A = 100 (i.e. tuning of 1 part in 100) 
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acceptable? or A = 10000? If no sign of TeV-scale supersymmetry is found at the LHC, is there a 
value of A for which we should give up a model based on low-energy supersymmetry? 

Further questions arise when comparing two models, as discussed in the following. Firstly, for 
a supersymmetric model that can fit the experimental data represented by a set of observables 0? x 
of theoretical values 0- h , one defines 

A good x 2 fit requires that (ridf denotes the number of degrees of freedom) 

X 2 / n df ~ 1, where n df = n -n p . (1.4) 

with no the number of observables fitted and n p the number of parameters of the model (yt, etc). 

Suppose now that we have two models A and B: model A has a very good fit X 2 / n df ~ 1 but 
fine-tuning A (according to some definition) of order ^(1000). So the model is likely but rather 
...unnatural! Consider that model B has a good X 2 / n df but slightly worse compared to model A, 
but its fine tuning much improved ^(10) (according to the same definition). How do we decide 
which model of the two is better? This situation is confusing, more so since current data fits indeed 
report separately the value of X 2 / n df and the value of A. Also the possibility of having a realistic 
model (i.e. good likelihood) that is unnatural, or a natural one but unrealistic reflects a puzzling 
situation. We would like to answer these questions, but since we cannot even agree on a definition 
for A, how to address them? 

It is then highly desirable not to rely on any definition for fine-tuning and to use more funda- 
mental arguments (and eventually derive a measure for naturalness from these). The first step to 
understand what happens is this: one tunes the parameters j{ to fit the observables. This brings a 
X 2 1 ndf "cost". One must also tune the same parameters /; to fix the EW scale, which is central to 
the issue of fine tuning A. Clearly, there is no technical difference between the two tunings. That 
means that total x 2 and A must be technically related. But what is their relation? 

Let us think from a different perspective and provide the central idea. While we cannot agree 
on a definition for fine-tuning (naturalness) in a model, we do know if a model is realistic or not: we 
test it against experimental data. We do so by computing the likelihood to fit the data (frequentist 
approach) or the Bayesian probability of the model. These are well defined mathematical (proba- 
bilistic) tools used to test a model. Next, recall the original goal of SUSY, central to the issue of 
naturalness: fixing the EW scale (niz) to its measured value, in the presence of quantum corrections. 
If we regard this as a constraint and impose it on the current likelihood to fit the data, the associated 
physical problem of naturalness (fine-tuning) should be captured by the mathematics that describes 
this constraint! So one should compute the likelihood to fit the data with this constraint imposed! 
Surprisingly, this is something not yet done by the current methods that compute the likelihood (or 
X 2 ) to fit the data in SUSY models. This is the central idea that we follow below. 

Without assuming any definition for fine-tuning or reference to this concept, we show how 
the constraint of fixing the EW scale automatically leads to a correction factor that impacts on 
(worsens) the current likelihood to fit the data. Once this mathematical relation is established one 
can answer, in a model-independent way and on probabilistic grounds, the questions listed above. 



A new approach to Naturalness. 



2. A new approach: how "constrained" likelihood accounts for naturalness. 

Let us then compute the likelihood to fit the data in a SUSY model under the mathematical 
constraint of fixing the electroweak scale to its measured value, in the presence of quantum cor- 
rections. The constraint encodes the physics behind the naturalness problem, and leads to a new 
perspective on this issue. Firstly note that in accurate data fits one usually performs a fit with the 
following observables (in a standard notation): 

sin 2 e^., T z , 8a^ m h ,m w ,Cl DM ,BR(B^X s y), BR(B s ^n + n~), BR(B u ^<uv), AM Bs . (2.1) 

However, fixing the EW scale to its accurately measured value (m^), is not on this list, so the current 
likelihood tests of supersymmetric models do not account for this effect (constraint). 

For each observable 0, denote the corresponding probability P(Oj\y,y), which is usually taken 
a Gaussian. P(0{\y,y) depends on parameters y= {71,72,....} introduced by SUSY and on other 

parameters too, like Yukawa couplings y = {y t ,yb, } that denote nuisance variables. These are 

variables that can be removed from the total likelihood by either maximizing it wrt to them (for 7 
fixed) or by integrating over them wrt some measure. Since observables 0, are independent, one 
multiplies their probability distributions to obtain a total distribution j&f(data|y,;y) where "data" 
stands for all observables. With "data" fixed, «Sf is a function of the sets 7, y only, so we denote it 
as J*f (data|7,y) = L(y,y), with 

L(Y,y) = ]JP(Oj\y,yy, y={mo,m 1/2 ^o,m ,A ,B ,---}; y={y t ,yb,~-} ( 2 - 2 ) 
j 

One then computes the corresponding % 2 = — 21nL, by seeking a region of the parameter space 
where X 2 / n df ~ 1 at the minimum, by tuning y ; - and y^ to fit observables Oj. If one remains true 
to the original goal of SUSY, one should impose on L the additional constraint of fixing the EW 
scale to the measured value m| and analyze its impact on L. This is what we shall do below. For 
technical details see the original papers [1]. 

We start with the scalar potential of supersymmetric models (#1,2 higgs doublets): 

V = m\\H l \ 2 W 2 \H2\ 2 -{m]H l .H2 + hx.) + {h/2)\H l \ A +{fa/2)\H2\ 4 +h\Hi\ 2 \H2f 
+ h\H i .H 2 \ 2 +[(X 5 /2)(H i .H2) 2 + h\H i \ 2 (H l .H2)+X 7 \H2\ 2 (H 1 .H 2 )+h.c.] (2.3) 

Some Xj are non-zero only at quantum level (considered here). Introduce effective couplings A, m 

X = Ai/2cos 4 J 8+A2/2sin 4 /3 + (A3 + A4 + A5)/4sin 2 2 J 8 + (A 6 cos 2 J 8+A 7 sin 2 j 8); 
m 2 = ra 2 cos 2 /3 + m 2 ; sin 2 j8 — m 2 sin 2/3, (2.4) 

We also denote v 2 = {h®} 2 + (h®) 2 . Then the EW minimum conditions determine v and tan/3: 

v-(-m 2 /A) 1/2 = 0, and tanj3 -tanj3 (7,y,v) = 0. (2.5) 
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where j3o is the root of the second minimum condition. Let us denote 

v(y,y,v,j3) = (-m 2 /A) 1/2 (2.6) 

which at the quantum level depends on the arguments shown. 

The "constrained" likelihood, hereafter denoted L w , to fit the data and also to fix the elec- 
troweak scale to its value (m z ) can be written in terms of the usual likelihood L to fit observables 
other than the EW scale itself, as follows: 



Lw(7,y)= ™z] dvd{t^)8[v-(-m 2 /X) l l 2 } 5[tanj8-tanj3o(y,y,v)] 8{m z -m z ) L(y,y,v,P) 
= vo 8[v -v(Y,y,v ,po(Y,y,vo)] £(y,y,vo,j3o(y,y,v )) 



VQ 

IVv 



-8[ni{lnzi-lnz°)]L(Y,y,v ,po(Y,y,vo)), with z.i = {Yj,yk}- (2.7) 

o 



with a sum over repeated index i; m z in front of the integral is a normalization factor and 

m z = (g 2 +g 2 2 y/ 2 v/2, m z = (g 2 +g 2 y/ 2 v Q /2, v = 246GeV (2.8) 

niz is the theoretical mass of Z boson, m z is its measured value (« 91.2 GeV) and g\,gi are gauge 
couplings of U(l), SU(2). Under integral (2.7) two Dirac delta functions impose EW minimum 
conditions (2.5). In the presence of these functions, the integrals over v, tan/3 are just a formal 
way to say that we solved the constraints of the EW minimum. Actually, the integral over v can be 
regarded as an integral over m z « v which is an observable of probability distribution 8(m z — m z ); 
the role of this delta function is to fix the EW scale to the measured value. Using 8{m z — m z ) is 
justified since niz is accurately measured. In the last step in (2.7) a Taylor expansion of v near vo 
was done, and n, are components of the normal to the surface: vo — v(y,y, vq, j3o(y,;y, vo)) = which 
has the solution Yj = Yj,yk = y%- So rc,- = (<9,v/|Vv|) where subscript "o" indicates evaluation at 
Yj> y°k- |V(...)] denotes the gradient (in parameter space lny ; -, lnj^), evaluated again at y?, y Q k [1]. 

From eq.(2.7), we conclude that the "constrained" likelihood L w is nonzero if Yj = 7? , Jfc = y®- 
These parameters are correlated by vo — v(y ,j ,vo,j8o(y°,3' ,vo)) = so one could eliminate one 
of them (usually chosen to be Ho). With this in mind, the result of the last equation becomes: 

A fy (r,j u ) 

where A q = \ Vlnv| , giving 

1 zi={7j,yk} ' J 

where notice that the sum over Zi extends over both Y\ , Jl , ■ ■ ■ ■ and y t , yi, , . . . . parameters ' . 

'in the last step in eq.(2.7) we used lnz,- as fundamental variables instead of z,-, where Zi = f\_, Ji, ...,Yn',yt,ybi — j this 
ensures dimensionless arguments under the last Dirac delta function. This equation can however be written in terms of 
Yj and y^ after prior normalization of Yj to some scale (like vo, etc), to ensure they are dimensionless. In this case the 
expression of An in eq. (2.10) is changed by replacing ln^ —¥ Yi/ v an d m >'/t -*yjfc. 
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The result in (2.9) shows that to maximize L w one should maximize not the traditional like- 
lihood L (as done in present data fits), but its ratio to the quantity A q that emerged above. Points 
in the parameter space with large A q can reduce the overall L w . The presence of A q in the con- 
strained L w is entirely due to imposing on the traditional likelihood L, of the constraint of fixing 
the EW scale in the presence of quantum corrections. Since this constraint encodes the physics of 
the naturalness problem, the quantity A q in (2.9) can only be interpreted as a model-independent 
measure of naturalness i.e. fine-tuning. It is for this reason that in the following we often refer 
to A q as "fine-tuning". We stress that A q is a quantity that was derived when computing L w , and 
not an ad-hoc definition! A q of (2.9) depends on SUSY parameters y, and on nuisance variables 
like Yukawa couplings y^. This answers the question on the parameters with respect to which one 
should actually compute fine-tuning. 

Further, using common assumptions such as Gaussian distributions for the observables, then 
X 1 = — InL, in which case the relation in (2.9) can be rewritten as 

xl = X 2 + 2\nA q . (2.11) 

A SUSY model that fits well the data and also fixes the EW scale must have total Xw/ n df ~ 1< 
Until now one would only try to satisfy the weaker condition % 2 l n df ~ 1 associated with fixing 
observables other than mz, and which is not easily satisfied in many SUSY models. 

Relation (2.11) for x 2 makes no distinction between tuning parameters to fit the EW scale 
or some other observable. Indeed, fixing the EW scale brings a x 2 "cost" (21nA ? ) for the model 
when A q > 1, just like any other observable. Eq.(2.1 1) also shows that we should actually minimize 
X^, i.e. the sum of two contributions. A small A q is preferable, in agreement with the traditional 
physical intuition (although it does not have to be minimized on its own). Finally, eq.(2. 11) answers 
the old question of how to compare two models: one with "good" X 2 / n df but "bad" fine-tuning 
and another with "not as good" x 2 '/ n df but "not as bad" fine-tuning: one should simply minimize 
and compare their total X 2 / n df which encodes the information of their likelihood to fit the data 
that includes the EW scale. In brief, naturalness is an intrinsic part of the likelihood to fit the data. 

We can also derive a criterion for what is acceptable fine tuning in a SUSY model: we demand 
a good fit for the constrained likelihood, i.e. Xw/ n df ~ 1 or so. This requires that 

A, < exp(n d/ /2) (2.12) 

which should be comfortably respected (given the extra # 2 -"cosf" of other observables). We thus 
have an answer to the question regarding an upper bound for fine tuning, beyond which the model 
may not be regarded as "natural" anymore. This bound is general and independent of the model 
details other than its number of degrees of freedom n<//. In many popular SUSY models tidf is of 
order 10, meaning that an acceptable upper bound is somewhere in the region A q <C exp(5) « 150. 
What happens if A q > exp(«rf//2)? In that case Xw/ n df becomes larger than unity even before 
fixing observables other than the EW scale itself! Obviously, for a given value of Xw/ n df such as 2 
or larger we have a poor fit and one is unable to fit the data and (fix) the EW scale at the same time. 
Then the model can be ruled out probabilistically. 
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The relation of %i- to A q provided a probabilistic interpretation for 2 A q . This helps us appreciate 
better the significance of a given value of A q . Suppose we have A q = 10 6 so # 2 receives a correction 
from fixing the EW scale equal to 21nl0 6 « 27.6. What does this mean? From eq.(1.3), assuming 
that one particular observable is m 5.25a deviations from its central value would bring a coiTection 
to total Xw that is similar to that of A q = 10 6 . By this argument we have the correspondence 
A q « 10 6 o 5.25a, and similarly A q « 1000 f* 3.5a and A ? « 100 «->■ 3a, and so on. To see 
this in a broader context, remember that currently the disagreement between the muon anomalous 
magnetic moment in theory versus experiment is about 3 a deviations [10]. According to the above 
correspondence this is similar to having A^ ss 100. We shall discuss shortly the numerical values 
of A q in popular SUSY models. 

3. How the Bayesian approach accounts for naturalness. 

Here we show how to extend our previous results to the Bayesian approach to fit the data, 
based on [2] (section 2) reviewed here. In this approach, one computes the Bayesian probability 
("evidence") ;?(data) of a model, given the data. Here p(data) is an integral over the parameter 
space Yt (and eventually y^ too) of the product between the likelihood L and the initial probability 
distributions p(y) ("priors") for the parameters 

p(data) = jdy l ....dy n p(y) L(y,y, v,j3) (3.1) 

This result for /?(data) is used to compare the relative probability (viability) of different models. 

To extend our results we simply use (3.1) with the replacement L — > L w . Eq.(2.7) that imposes 
the constraint of fixing the EW scale, is multiplied by p{y) and integrated over /;. The result is 3 



p(data) = m| I dy\....dy n p(y) dv d (tan J3) 

x S[v-v(y,y,v,p)] 5[tanj3-tanj3 (y,y,v)] 8(m z -m° z ) L(y,y,v,p) (3.3) 

= v J dYi-.dY n p(Y,y) %o-v(y,y,vo,j3 (y,y,vo))] L (Y,y,vo,Po(Y,y,vo)) 

= dS]a rw — r^\ L {r^ v o^o(7,y)) 7i-7n p{7), E: v = v(7,y,vo,j3o(y,;y,vo)) 

This is just a global version in parameter space of the local result in (2.7), (2.9); in deriving 
this result, the priors are spectators. The integral above is over the surface £ defined by vo — 
v(7,y,vo,j8o(7,y,vo)) = 0. Note the emergence of the suppression factor |Vi nr lnv| = A q where 
A^ is that given by eq.(2. 10) but with the sum restricted to y parameters (Vi„y denotes the gradi- 
ent computed wrt lny,- variables). If one integrated (3.3) over Yukawa (nuisance) parameters too, 



2 A probabilistic interpretation for the inverse of A was suggested, on physical grounds, in [9]. 
3 One can also use that (the surface integral is over S defined by g = 0): 

/ f{zi,—,Zn)8(g(zi,...,Zn))dzi....dZn= dS n _if{z\ : ...,Z n ) 7= r (3.2) 

JR" JSn-i |V-.g| 
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then A q is exactly that of eq.(2.10) [2]. The 1/A g factor under the integral of /?(data) in the last 
line of (3.3) acts as an extra prior and is solely due to the constraint of fixing the EW scale via 
8(mz —m^), as also seen in the local case. By comparing this result for /?(data), evaluated nu- 
merically for different models, one decides which model is more likely, while taking into account 
the effect of 1/A^ associated with naturalness. Finally, eq.(3.3) simplifies further for logarithmic 
priors, p(y) = 1/(7172. ...y„). 

Let us mention an equivalent form of the result in (3.3) which is 

p(data) = / dSy L[7,v,vo,j8o(7,v,v )] p(y) (3.4) 

Jz |Vylnv| 

where the gradient V r and dSy are defined with respect to parameters 7 instead of In 7 used in 
(3.3). In this case A q changes accordingly, A q = | V r lnv|. 

Unlike the "frequentist" approach discussed in the previous section, current data fits in the 
Bayesian approach already include the "cost" of naturalness in their results. Early attempts were 
based on a choice of priors that go like 1 /A where A was an ad-hoc definition for fine-tuning wrt a 
given parameter [11]. That meant that points that were badly fine-tuned wrt that particular choice 
of fine-tuning, were giving a smaller contribution to global p (data). More recently, a so-called 
"Jacobian factor" was identified to account for a similar effect [12, 13], starting from assumptions 
similar to those used here (see [13] for more details). It would be interesting to establish the exact 
relation of this Jacobian factor to the factor l/A q that we identified above to account for naturalness. 
This ends our discussion on the Bayesian approach and on how A q emerges in this case. 

4. The minimal value of A q in supersymmetric models and its impact on Xw/ n df> 

The next step in this analysis is to answer what values A q of eq.(2. 10) takes in current SUSY 
models. We then examine the impact it has on total %w/ n df, see eq.(2. 11). Can one satisfy condition 
(2.12)? To this purposes we use the results in [2], For related studies see [15, 16, 17, 19, 20, 21, 22] 
and for recent data fits in SUSY models see [17, 18]. We use eq.(2.10) evaluated wrt to the SUSY 
parameters 7 only, listed below for each model considered; this restriction underestimates A q , so 
from this point of view our analysis is conservative. The models considered here are [23] : 

• the constrained MSSM model (CMSSM), of parameters yj = {mo,mi/2,Ho,Ao,Bo}, in a standard 
notation for soft masses of squarks and sleptons (mo), gaugino (mm), usual p. term (po), trilinear 
(Aq) and bilinear (Bo) SUSY-breaking terms. 

• the NUHM1 model: this is a CMSSM-like model but with Higgs masses in the ultraviolet (uv) 
different from mo, m'£' = m™ 7^ mo, with parameters yj = {mo,mi/2,HQ,Ao,Bo,mf}. 

• the NUHM2 model: this is a CMSSM-like model with non-universal Higgs mass, m^/m^/mo, 
with independent parameters yj = {mo,mi/2,jUo,^o,Bo,m£ V ,mj[ v }. 

• the NUGM model: this is a CMSSM-like model with non-universal gaugino masses mi., i = 
1,2,3, with 7/ = {mo, Mo,Ao, Bo, m Al , m^,mx 3 }. 

• the NMSSM model: compared to the CMSSM, in this case there is an additional gauge singlet, 
so yj = {mo,m 1 i 2 ,Po,Ao,BQ,m l / 2 ,rns} where m$ is the singlet soft mass. 

8 
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Figure 1: A q versus M^gg S in various models, from ref.[2]; lightest grey (0) area: excluded by SUSY mass 
bounds; darker grey (1): excluded by b — >• sy, B — > fi + ji~ , 8p; dark grey (2): excluded by condition 8a^ > 0. 
Area inside the closed contour: allowed by data and with 2 a deviation of g — 2: 8a^ < (25.5+2x8)10 -10 ; 
Sa™" x is shown colour encoded. Area outside closed contour: 8a"™ x < (25.5 — 2x8)10~ 10 (2<7). Only in the 
NUGM is one close to satisfying the g—2 constraint within 2a, not too surprising given its non-universal 
gaugino masses. In all plots the dark matter relic density was computed [14] and can be fitted within 3(7 [2], 
For similar plots in NMSSM or GNMSSM see [15] while for the GNMSSM in the limit of a massive gauge 
singlet (integrated out) see plots in [16]. 



• the general NMSSM model (GNMSSM): this is an extension of the NMSSM with a bilinear 
superpotential term, MS 2 , where S is the singlet superfield. So we have one more parameter (M) in 
addition to the parameters of the NMSSM. 

In all models a complete scan over the entire parameter space jj is performed [2] including tan/3 
with standard experimental constraints 4 . The result for A q (computed at two-lop using SOFTSUSY 
[24]), is plotted in Figure 1 as a function of the SM-like higgs mass mi, (computed at two-loop 
leading log), for each model, except the NMSSM, GNMSSM cases. Notice the log scale in the 
plots (on OY axis), indicating an approximately exponential increase of A^ wrt m/ 2 [19]. We find it 
interesting that if one also computed A defined by (1.1), the plots have an identical form to those 
for A q , but only slightly shifted along OY axis, to a lower A by a factor between 1 and 2. 

From Figure 1, after fixing the higgs mass to its measured value m/, ~ 126 GeV at the LHC 
[25] one immediately identifies in each model the minimal value of A q obtained after scanning the 
whole parameter space. The results are summarized in Table 1 for minimal A q and its corresponding 
8% 2 = 2\nA q contribution, for m/, near the pre-LHC bound (115 GeV) and for the current value 
(126 GeV). The results in this table include additional numerical results for the NMSSM and also 
GNMSSM models for which we did not display the plots of A q as a function of m/,, but used instead 



4 For the experimental data used see table 1 of [2], The muon 8a ^ is not imposed but shown separately in Figures 1. 
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Model 


tip 


Approx 


Aq(U5) 


5^ 2 (115) 


A ? (126) 


<5Z 2 (126) 


n df 


^/iid/(i26) 


CMSSM 


5 


2-loop 


15 


5.42 


1800 


14.99 


9 


2.66 


NUHM1 


6 


2-loop 


100 


9.21 


1500 


14.63 


8 


2.83 


NUHM2 


7 


2-loop 


85 


8.89 


1300 


14.34 


7 


3.05 


NUGM 


7 


2-loop 


15 


5.42 


1000 


13.82 


7 


2.97 


NMSSM 


6 


1-loop 


12 


4.97 


>200 


10.59 


8 


2.32 


GNMSSM 


7 


1-loop 


12 


4.97 


27 


6.59 


7 


1.94 



Table 1: The correction <5j 2 = 21n A 9 (y), the number of parameters n p and degrees of freedom n c /f in SUSY 
models, for m/, m 1 15 GeV corresponding to the pre-LHC bound and for ;«/, « 126 GeV. For 1 15 GeV, one 
had 8% 2 /n c if < 1, thus leaving the possibility of having Xw/ n df ~ !• F° r tne 126 GeV value, 8% 2 /ndf > 1 
in all models above, except the GNMSSM where this is close to unity. This means that the requirement 
of a good fit (Xw/ n df ~ 1) is already violated by the fixing of the EW scale alone, before considering the 
additional % 2 cost of other observables. Assuming these give the usual X 2 / n df ~ 1 i- e - a good fit, adding the 
impact of 8% 2 leads to the total Xwl n df displayed in the last column, tidf may vary, depending on the exact 
number of observables fitted. The numerical values of A q are from [2] (first 4 models) and correspond to the 
plots in Figure 1, while for GNMSSM and NMSSM we used the results in [15] and [17]. 

the results of [15], [17]. Although not shown in the table, for all models 8% 2 shown varies by w 1 
for a similar sign variation of the higgs mass by 1 GeV. 

The results in Table 1 show that A q has a significant impact on the overall quality of the 
fit (i.e. the value of %w/ n df) since it brings corrections 8% 2 /ndf > 1 for models other than the 
GNMSSM. This means that the requirement of a good fit {% 2 / n df ~ 1) is already violated by 
the fixing of the EW scale alone, before considering the additional % 2 cost of other observables. 
So naturalness, encoded by the constraint of fixing the EW scale, has a very significant % 2 cost. 
Assuming (ideally) a good fit under current experimental constraints, i.e. X 2 l n df ~ 1, one finds 
that overall Xw/ n df > 2.3 in the models discussed. An exception is the GNMSSM model, where 
this result is improved mildly (« 1.94). 

There is another interesting aspect in Table 1. While in some models A q at 126 GeV decreases 
relative to the CMSSM case, the correction 8% 2 /ridf and total %w/ n df can actually increase (!), 
because of the stronger effect of decreasing ndf (larger n p ). Thus, unless it is very significant, 
a reduction of A q due to new physics (more parameters) does not necessarily lead to a reduction 
of Xw/ n df and a better fit. In GNMSSM the reduction of A q is significant enough and Xw/ n df is 
improved relative to the CMSSM (for a recent GNMSSM study see [26]). 

5. Conclusions 

Our conclusions are best summarized by listing the old, unanswered questions in the introduc- 
tion on the problem of fine-tuning in SUSY models, together with the answers that we found. 

• Is there a relation between naturalness and the likelihood to fit the data, and if so, can we derive, 
on mathematical grounds, an expression for fine-tuning? 
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To answer this, one should recall the original goal of SUSY, central to the physical problem of 
naturalness (fine-tuning): to^k the EW scale to its measured value, in the presence of quantum 
corrections (while assuming the model is valid up to the Planck scale). If we regard this as a 
constraint and impose it on the current likelihood to fit the data the associated physical problem of 
naturalness (fine-tuning) should be captured by the mathematics that describes the constraint. We 
computed the "constrained" likelihood L w by imposing this constraint on the likelihood L to fit the 
data (other than the EW scale) and found that L w = L/A g . So one should maximize L w , not L. The 
factor A^ is due to this constraint alone, so it encodes the mathematical effect of the naturalness; 
thus, it can only be regarded as a derived fine-tuning expression. Rather surprisingly, current data 
fits (in the frequentist approach) do no account for this (suppressing) effect on the likelihood, thus 
leading to more optimistic results. 

• What are the parameters with respect to which A q should be computed? 

In the literature it is often unclear whether to include Yukawa couplings among these parameters. 
If one computes the constrained likelihood L w as explained, there is no choice: these parameters 
include both the SUSY parameters and the nuisance variables (Yukawa couplings, etc) as well. One 
can maximize L w wrt Yukawa (SUSY parameters fixed) and thus eliminate them. Alternatively, one 
can marginalize L w over nuisance variables in which case A q is computed with respect to the SUSY 
parameters only. 

• What is an "acceptable" value for A^ identified above? 

The relation between the likelihoods L w = L/A q , has a correspondent in x 2 = X 2 + 21nA^ (assum- 
ing x 2 = — 21nL). So total x 2 accounts for tuning the parameters to fix both the usual observables 
and the EW scale, and it makes no distinction between these tunings which are treated on equal 
footing, as it should be the case. In particular, electroweak scale ("fine") tuning is nothing but x 2 
"cost"! In other words, naturalness is an intrinsic part of the likelihood to fit the data that includes 
the EW scale. For a good fit one should then have a total Xw/ n df ~ 1 where n^f is the number of 
degrees of freedom. This implies a necessary bound A q < exp(n^/2), which should be comfort- 
ably respected (i.e. A q <C exp(n^//2), since often % 2 l n df is itself close to unity). For the popular 
models discussed this means A <C 90 (n^f < 9). 

• Can we rule out supersymmetric models based on the size of the quantity A q alone? 

The answer is yes; when A q S> exp(n^//2) then the total Xw/ n df is so poor that the model cannot fit 
the data anymore and is ruled out. Note however that the addition of new physics, that can actually 
be beyond the LHC reach, could reduce A q , as seen for the "general NMSSM" (GNMSSM) model. 

• Does A q have a physical meaning? 

The relation %t = X +2\aA q gives A q a probabilistic interpretation. A value of say A^ = 10 6 has 
an effect on x 2 similar to one observable that contributes to x 2 being « 5.25a deviations from the 
central value. By the same rule, A q = 100 is equivalent to a deviation of ps 3a, which is the order 
of deviation of the muon magnetic moment. 

• How can we compare two models and decide which is better: one that has a very good x 2 fit in 
the traditional sense, but significant fine tuning and one that has a nearly-as-good x 2 fit but less 
fine-tuning ? 
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The question arises since accurate data fits usually report separately X 2 l n df and fine tuning A com- 
puted according to some definition. We showed that to compare such models, one should actually 
compare their total Xw/ n dfi and this answers the question above. This is the 'frequentist' case. 
In the Bayesian approach one compares instead the "constrained" probabilities of the two models 
/?(data) that contain an additional l/A q factor under their integral. This factor again emerges (as 
an extra prior) from the naturalness constraint of fixing the EW scale at quantum level. 

• How can one reduce A q and also Xw/ n df to improve the data fits of SUSY models? 

The easiest way to reduce A q is to increase the effective quartic higgs coupling A, whose small 
value is at the origin of large A q in most SUSY models 5 . One can increase A by new physics (su- 
persymmetric!) beyond the MSSM higgs sector [27] and possibly beyond the LHC reach too! The 
new physics can be represented by additional massive states that couple to the higgs sector: sin- 
glets, SU(2) doublets, additional massive U(l) boson, etc. However, adding new physics increases 
the number of parameters (reduces n^f). Therefore, unless the reduction in A q is very significant (to 
A q < 10 or so), the total Xw/ n df can actually increase due to the reduced ndf and to the mild (log) 
dependence of xl on A q . Of all models discussed (MSSM or NMSSM-like) only the GNMSSM 
model had A q < 10; this model emerges as the one with the best chance of a good fit (Xw/ n df) in 
the sense discussed above. 

• Can one compare the relative viability of two models based on the values of A q alone? 

The answer is negative. It is the total Xw/ n df that matters when comparing models, not A q alone. 
This is because one model can have a smaller A q (by adding more parameters) but a larger Xw/ n df 
than in the second model (see previous point). 

To conclude, naturalness in SUSY models is an intrinsic part of the likelihood to fit the exper- 
imental data that includes the EW scale. This helps address many open questions on this issue, as 
shown above. The results above indicate that for some popular SUSY models the value obtained 
for the corrected likelihood (i.e. Xw/ n df) already questions their viability, except perhaps the case 
of GNMSSM. This can be investigated further by performing accurate data fits that evaluate the 
new Xw/ n df that we identified. 
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